Learning To Predict A Student's Performance On Problem Sets
نویسنده
چکیده
Project Goal The goal of this project is to show that it is possible to create a system that can effectively predict whether a student will get a problem right or wrong, given the performance of other students on a set of problems which contains the target problem and given the students performance on the same set of problems minus the target problem. The reason I would find such a system useful is that I plan on making a web site that generates dynamic problems sets for my senior project. The idea is that the web site would use students history on problems to create customized optimal sets of problems for learning a particular topic. Predicting the probability that a student gets a problem right is key to developing such a system. Data For this project the Grade 7 math problems from the California SAT9 exam were used as the problem set. The data is composed of over 400,000 students, each of whom have taken 80 problems. For the predictive system to be effective, it must accurately estimate the probability of a students success on a problem, as opposed to predicting a binary value for success or failure. This is because, it would be useful when creating dynamic problem sets, to be able to specify how challenging a problem set should. Thus rather than just calculating the average number of times a prediction is wrong, error should measure the standard deviation of the difference between the success and the predicted probability of success. A measure of average absolute value of differences promotes a binary prediction while using sum of squared errors promotes an accurate estimate of probability. The measure of error therefore will be: error = sqrt (∑i:m∑j:n (yHat ij – y ij))/(m*n)) where yHat is the predicted probability of getting target problem right, and y ij is success on the ith problem on the jth test (1 if correct, 0 if wrong). The error is summed over n, because each of the n problems is used as a target and over m for each of the m students in the sample. For frame of reference the error generated by simply guessing the mean success of the target problem over all students in the training set is shown in the table below: (Note: The fact that the generalized error was smaller for the smallest training sets is idiosyncratic to the …
منابع مشابه
Ensemble-based Top-k Recommender System Considering Incomplete Data
Recommender systems have been widely used in e-commerce applications. They are a subclass of information filtering system, used to either predict whether a user will prefer an item (prediction problem) or identify a set of k items that will be user-interest (Top-k recommendation problem). Demanding sufficient ratings to make robust predictions and suggesting qualified recommendations are two si...
متن کاملMMDT: Multi-Objective Memetic Rule Learning from Decision Tree
In this article, a Multi-Objective Memetic Algorithm (MA) for rule learning is proposed. Prediction accuracy and interpretation are two measures that conflict with each other. In this approach, we consider accuracy and interpretation of rules sets. Additionally, individual classifiers face other problems such as huge sizes, high dimensionality and imbalance classes’ distribution data sets. This...
متن کاملThe Effects of Learning Self-Regulated Learning Strategies on High School Students\' Math Problem-Solving Skills
The Effects of Learning Self-Regulated Learning Strategies on High School Students' Math Problem-Solving Skills B. Shaalchi, Ph.D. Y. Rezaapoor J. Baabaapoor Kheyrodin, Ph.D. M. Alizaad Sardroodi F. Abdollaahi To see the effects of learning self-regulated learning strategies on math problem-solving skills among high school students, a sample of 271 randomly selected h...
متن کاملپیشبینی پیشرفت تحصیلی دانشآموزان دبیرستانی براساس باورهای خود کارآمدی، وضعیت اجتماعی- اقتصادی و رشته تحصیلی
Social-cognitive theories on motivation and academic achievement emphasize the interactive effects of Personal and contextual variables on the students' academic achievement. The main purpose of the present study was to investigate the predictive role of self- efficacy and socio-economic status as two important personal and contextual variables on student's academic achievement. 500 stuents...
متن کاملSports Result Prediction Based on Machine Learning and Computational Intelligence Approaches: A Survey
In the current world, sports produce considerable statistical information about each player, team, games, and seasons. Traditional sports science believed science to be owned by experts, coaches, team managers, and analyzers. However, sports organizations have recently realized the abundant science available in their data and sought to take advantage of that science through the use of data mini...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008